The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms
نویسندگان
چکیده
The analysis of lectures and meetings inside smart rooms has recently attracted much interest in the literature, being the focus of international projects and technology evaluations. A key enabler for progress in this area is the availability of Ambrish Tyagi has contributed to this work during two summer internships with the IBM T.J. Watson Research Center. D. Mostefa (&) N. Moreau K. Choukri Evaluations and Language Resources Distribution Agency (ELDA), 55–57 rue Brillat Savarin, 75013 Paris, France e-mail: [email protected] URL: http://www.elda.org N. Moreau e-mail: [email protected] K. Choukri e-mail: [email protected] G. Potamianos S. M. Chu A. Tyagi IBM T.J. Watson Research Center, Yorktown Heights, NY 10598, USA URL: http://www.ait.gr G. Potamianos e-mail: [email protected] S. M. Chu e-mail: [email protected] Present Address: A. Tyagi Department of Computer Science and Engineering, The Ohio State University, Columbus, OH, USA J. R. Casas J. Turmo Universitat Politècnica de Catalunya, Barcelona, Spain J. R. Casas e-mail: [email protected] 123 Lang Resources & Evaluation (2007) 41:389–407 DOI 10.1007/s10579-007-9054-4
منابع مشابه
The CHIL RT07 Evaluation Data
This paper describes the CHIL 2007 evaluation data set provided for the Rich Transcription 2007 Meeting Recognition Evaluation (RT07) in terms of recording setup, scenario, speaker demagogic and transcription process. The corpus consists of 25 interactive seminars recorded at five different recording sites in Europe and the United States in multi-sensory smart rooms. We compare speakers’ talk-t...
متن کاملA Joint System for Single-Person 2D-Face and 3D-Head Tracking in CHIL Seminars
We present the IBM systems submitted and evaluated within the CLEAR’06 evaluation campaign for the tasks of single person visual 3D tracking (localization) and 2D face tracking on CHIL seminar data. The two systems are significantly inter-connected to justify their presentation within a single paper as a joint vision system for single person 2D-face and 3D-head tracking, suitable for smart room...
متن کاملDetection, diarization, and transcription of far-field lecture speech
Speech processing of lectures recorded inside smart rooms has recently attracted much interest. In particular, the topic has been central to the Rich Transcription (RT) Meeting Recognition Evaluation campaign series, sponsored by NIST, with emphasis placed on benchmarking speech activity detection (SAD), speaker diarization (SPKR), speech-to-text (STT), and speakerattributed STT (SASTT) technol...
متن کاملSome Preliminary Results on Multimodal Recognition of Events in Smart Meeting Rooms
This paper aims to present some novel ideas developed for the work on automatic meeting transcription in the framework of a European Community sponsored research project. The goal of this project is the development of a meeting browser supporting the multimodal analysis of and access to meetings held in smart meeting rooms. Due to this multimodal aspect, speech recognition as well as visual pro...
متن کاملThe IBM Rich Transcription Spring 2006 Speech-to-Text System for Lecture Meetings
We describe the IBM systems submitted to the NIST RT06s Speech-to-Text (STT) evaluation campaign on the CHIL lecture meeting data for three conditions: Multiple distant microphone (MDM), single distant microphone (SDM), and individual headset microphone (IHM). The system building process is similar to the IBM conversational telephone speech recognition system. However, the best models for the f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Language Resources and Evaluation
دوره 41 شماره
صفحات -
تاریخ انتشار 2007